A non-linear optimization procedure to estimate distances and instantaneous substitution rate matrices under the GTR model

نویسندگان

  • Daniele Catanzaro
  • Raffaele Pesenti
  • Michel C. Milinkovitch
چکیده

MOTIVATION The general-time-reversible (GTR) model is one of the most popular models of nucleotide substitution because it constitutes a good trade-off between mathematical tractability and biological reality. However, when it is applied for inferring evolutionary distances and/or instantaneous rate matrices, the GTR model seems more prone to inapplicability than more restrictive time-reversible models. Although it has been previously noted that the causes for intractability are caused by the impossibility of computing the logarithm of a matrix characterised by negative eigenvalues, the issue has not been investigated further. RESULTS Here, we formally characterize the mathematical conditions, and discuss their biological interpretation, which lead to the inapplicability of the GTR model. We investigate the relations between, on one hand, the occurrence of negative eigenvalues and, on the other hand, both sequence length and sequence divergence. We then propose a possible re-formulation of previous procedures in terms of a non-linear optimization problem. We analytically investigate the effect of our approach on the estimated evolutionary distances and transition probability matrix. Finally, we provide an analysis on the goodness of the solution we propose. A numerical example is discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessing the Applicability of the GTR Nucleotide Substitution Model Through Simulations

The General Time Reversible (GTR) model of nucleotide substitution is at the core of many distance-based and character-based phylogeny inference methods. The procedure described by Waddell and Steel (1997), for estimating distances and instantaneous substitution rate matrices, R, under the GTR model, is known to be inapplicable under some conditions, ie, it leads to the inapplicability of the G...

متن کامل

Finding the Best Coefficients in the Objective Function of a Linear Quadratic Control Problem

Finding the best weights of the state variables and the control variables in the objective function of a linear-quadratic control problem is considered. The weights of these variables are considered as two diagonal matrices with appropriate size and so the objective function of the control problem becomes a function of the diagonal elements of these matrices. The optimization problem which is d...

متن کامل

MULTI-OBJECTIVE OPTIMAL DESIGN OF SATMD INCLUDING SOIL-STRUCTURE INTERACTION USING NSGA-II

In this paper, a procedure has been introduced to the multi-objective optimal design of semi-active tuned mass dampers (SATMDs) with variable stiffness for nonlinear structures considering soil-structure interaction under multiple earthquakes. Three bi-objective optimization problems have been defined by considering the mean of maximum inter-story drift as safety criterion of structural compone...

متن کامل

Development of a site-specific regression model for assessment of road-header cutting performance of Tabas coal mine based on rock properties

In underground excavation, where the road-headers are employed, a precise prediction of the road-header performance has a vital role in the economy of the project. In this paper, a new model is developed for prediction of the road-header performance using the non-linear multivariate regression analysis. This model is able to estimate the instantaneous cutting rate (ICR) of roadheader based on r...

متن کامل

The optimal energy carriers substitutes in thermal power plants:A fuzzy linear programming model

In this paper, a dynamic optimization approach for optimal choice of energy carriers in thermal power plants is proposed that analyzes the substitution of energy carriers in short-term planning of a power plant.The model is based on the linear programming method with the objective of minimizing costs under constraints of resource availability, energy balances, environmental regulations and elec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 22 6  شماره 

صفحات  -

تاریخ انتشار 2006